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Abstract — This paper gives a simplified proof for the existence 
of nested lattice codebooks that allow to achieve the capacity 
of the additive white Gaussian noise channel. The proof is self- 
contained and relies only on basic probabilistic and geometrical 
arguments. An ensemble of nested lattices which is different than 
the one used in previous proofs is introduced. This ensemble, in 
addition to giving rise to a simple proof, can be easily generalized 
to an ensemble of nested lattices chains. As a result, the proof 
technique given here easily extends to showing the existence of 
"good" chains of nested lattices. 

I. Introduction 

Linear codes codes have attracted much attention from 
researchers ever since the first days of Information Theory, 
as a mean of approaching the capacity of additive discrete 
memoryless channels (DMC) with a reasonable encoding 
and decoding complexity. Interestingly, lattice codes, which 
are their Euclidean counterparts, were first considered for 
communication over additive white Gaussian noise (AWGN) 
channels only in the mid 1970s by de Buda [ 1 1. The relatively 
late interest in lattice codes may be in part due to the power 
constrained nature of the AWGN channel, which requires an 
additional shaping procedure, that is not needed for linear 
codes over DMCs. 

Lattice codes are attractive candidates for coding over 
the AWGN channel due to their structure, which can be 
exploited by encoding and decoding procedures with reduced 
complexity compared to non-structured codes. The question as 
to whether such codes can achieve the capacity of the AWGN 
channel with lattice encoding and decoding was fully resolved 
in where a coding scheme based on nested lattice codes in 
conjunction with Wiener estimation was proposed. Using this 
scheme, which we describe in detail in Section IH-AI it was 
shown that for the AWGN channel, lattice codes with lattice 
encoding and decoding can achieve good error exponents. As 
a corollary, it followed that the AWGN channel's capacity is 
achievable with lattice encoding and decoding. In J3] it was 
shown that for rates sufficiently close to capacity (i.e., above 
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the critical rate of the channel) nested lattice codes actually 
achieve the optimal error exponent. 

In the last decade lattice codes were found to play a 
new role in Information Theory. Following the emergence of 
wireless communication as a leading technology, the interest 
in understanding the fundamental limits of Gaussian networks 
had grown. In such networks, there are multiple transmitters 
and receivers, where each receiver sees a linear combination 
of the signals sent by the different transmitters, corrupted by 
AWGN. Surprisingly, in networks of this type, coding schemes 
that utilize lattice codes often outperform the best known 
random coding schemes. Thus, lattice codes are used in order 
to obtain new achievable rate regions, rather than reducing 
complexity. 

As a consequence, the nested lattices coding scheme of (2) 
has been extensively used for proving new coding theorems for 
Gaussian networks. Since for the majority of these networks 
the capacity is not known, the question of finding the best error 
exponent is far out of scope. Therefore, when |2| is used in 
the context of Gaussian networks, it is usually the capacity 
result that is used and not the error exponent results. 

The aim of this paper is to make the capacity result from [2] 
more accessible. To this end, we derive this result directly, 
without going through error exponent analysis. This leads to 
a simplified proof, that is completely self contained and uses 
only elementary probabilistic and geometrical arguments. A 
major advantage of the proof technique is that it can be easily 
extended to more complicated structures of nested lattices. 

There are two main differences between the approach taken 
in this work and previous works. In previous work, a two- 
step construction was considered. In the first step, assuming 
a coarse lattice that is good for covering (and hence also for 
quantization) is given, a dense self-similar nested lattice pair 
is generated via scaling. In the second step, the fine lattice is 
diluted. In this paper we simultaneously construct the nested 
lattices by taking a chain of subcodes of one linear code. 
This is a direct extension of the construction of nested linear 
binary codes proposed by Zamir and Shamai in (4). Another 
difference from previous works is that rather than requiring 
that the coarse lattice will be good for covering, we only 
require that it will be good for quantization. 



II. Preliminaries on Lattice Codes 

A lattice A is a discrete subgroup of K™ which is closed 
under reflection and real addition. Any lattice A in M" is 
spanned by some n x n matrix F such that 

A = {t = Fa : a e Z n }. 

We denote the nearest neighbor quantizer associated with the 
lattice A by 



Qa(x) = argmin ||x - t| 



(1) 



The basic Voronoi region of A, denoted by V, is the set of all 
points in M™ which are quantized to the zero vector, where 
ties in (Q~|l are broken in a systematic manner. The modulo 
operation returns the quantization error w.r.t. the lattice, 

[x] mod A = x — Qa(x), 

and satisfies the distributive law, 

[[x] mod A + y] mod A = [x + y] mod A. 

Let V(A) be the volume of a fundamental cell of A, e.g., the 
volume of V, and let U be a random variable uniformly dis- 
tributed over V. We define the second moment per dimension 
associated with A as 

^,A) 4 I E || Uf . ii^E. 
n n V(A) 

The normalized second moment (NSM) of a lattice A is 
defined by 

G(A) = 

Note that this quantity is invariant to scaling of the lattice A. 

It is often useful to compare the properties of the Voronoi 
region V with those of a ball. 

Definition 1: Let 

8(s,r)^{x€l" : ||x - s|| < r} , 

denote the closed n-dimensional ball with radius r centered 
at s. We refer to the volume of an n-dimensional ball with 
unit radius as V n . In general V (B(s,r)) — V n r n . Note that 
nVn n < 2ire for all n 0, and 



lim nVn = 2ne. 



(2) 



The ball B(0,r) has the smallest second moment per 

dimension out of all sets in K™ with volume V n r n , and it 
is given by 

1 1 



(B(0,r)) 



n V n r n 
1 1 



x£B(0,r) 



n V n r n J 
1 1 nV n r n+2 
n V n r n n + 2 

^2 



r' 2 d(V n r' n ) 



(3) 



It follows that B(0,r) has the smallest possible NSM 



G(B(0,r)) 



a 2 (B(0,r)) 



(4) 



V(B(0,r)) n + 2 

which approaches l/(27re) from above as n — > oo. 

Thus, the NSM of any lattice in any dimension satisfies 
G(A) > l/(27re). 

A sequence of lattices A^™) with growing dimension is 
called good for mean squared error (MSE) quantization if 



lim g(a^) = 



We define the effective radius r e ff(A) as the radius of a ball 
which has the same volume as A, i.e., 



' eff 



(A) 4 



V(A) 



2/i. 



v, 



2/r, 



(5) 



Since B(0, r e ff(A)) has the smallest second moment of all sets 
in K n with volume T^(A), we have 



f (A) 



a 2 (B(0,r eff (A))) = ^^<^(A). 

n + 2 



Thus. 



reff(A) < V(n + 2)<7 2 (A). 



(6) 



Note that for large n we have 

- 2 f (A) V(A) 2 / n 



2ire 



A lattice A c is said to be nested in Af if A c C A/. The 
lattice A c is referred to as the coarse lattice and Af as the fine 
lattice. The nesting ratio is defined as {V{A c )/V{Af)) 1 ^ n . 

Definition 2: The operation of coset nearest neighbor de- 
coding with respect to the pair of nested lattices A c C A/ is 
defined by 

g(Y)= [Q A/ 00] modA c . 

In words, coset nearest neighbor decoding refers to finding the 
closest lattice point in Af and identifying its coset leader by 
reducing modulo A c . 

Definition 3: We say that a sequence in n of random 
noise vectors Z'™) of length n with (finite) effective variance 
cr| = iE||Z(")|| 2 , is semi norm-ergodic if for any e > 0, 
S > and n large enough 



Pr fzW $ B(0, + 5)na 



< e. 



(7) 



Note that by the law of large numbers, any i.i.d. noise is 
semi norm-ergodic. However, even for non i.i.d. noise, the 
requirement (O is not very restrictive. In the sequel we omit 
the dimension index, and denote the sequence Z^ n > simply by 
Z. 

Next, we define "good" pairs of nested lattices. Our defi- 
nition for the "goodness" of nested lattices pairs is different 
from the one used in 0. 



Definition 4: A sequence of pairs of nested lattices 
A ( c n) C is called "good" if: 

1) The sequence of coarse lattices A c is good for MSE 
quantization. 

2) For any lattice point t € ^/"^ an d additive semi norm- 
ergodic noise Z with effective variance <r| = iE||Z|| 2 



where 



lim Pr f [q. ( „, (t + Z)l mod ^ t mod A("'l = 0, 
provided tha{] 

lim WA ( f n) )t > 2neal. 

n— s-oo 

That is, the error probability under coset nearest neighbor 
decoding in the presence of semi norm-ergodic additive 
noise Z vanishes with n if r e g (A^™' ) > yjna\ . 

A. 77ze mod- A Transmission Scheme 
Consider the AWGN channel 

Y = X + N, 

where N ~ AT(0, 1) and AT is subject to the average power 
constraint K(X 2 ) < SNR. We now briefly recall the nested 
lattices coding scheme proposed in [2] for communication over 
this channel. 

A pair of nested lattices A c C A/ is used in order to 
construct the codebook C — Af n V c with rat^l 

R=-hg ( 777x4 ) = log(nesting ratio). 

Each of the 2 nR messages is mapped to a codeword in C. 
Assume the transmitter wants to send the message w which 
corresponds to the codeword t € C. It transmits 

X = [t - U] mod A c , 

where U is a random dither that is uniformly distributed over 
V c and is statistically independent of t, and it is common 
randomness known to both the transmitter and the receiver. 
Due to the Crypto Lemma (2] Lemma 1], X is also uniformly 
distributed over V and is statistically independent of t. Thus, 
the average transmission power is i/nEHXH 2 = er 2 (A c ). In the 
sequel we assume that A c is scaled such that <r 2 (A c ) = SNR, 
which satisfies the power constraint. 

The receiver scales its observation by a factor a > to be 
specified later, adds back the dither U and reduces the result 
modulo the coarse lattice 

Y eff = [aY + U] mod A c 

= [X + U + (a - 1)X + aN] mod A c 
= [t + (a - 1)X + aN] mod A c 
= [t + Z eff ] mod A c , 



(8) 



'in (2j the volume-to-noise ratio (VNR) was defined as 



fj, = y(A^ n) )-/27reo-|. 

Thus, the condition V(A^)™ > 27recr| is equivalent to VNR > 1. 

2 A11 logarithms in this paper are to the base 2, and therefore all rates are 
expressed in bits per (real) channel use. 



Zeff = (a- 1)X + aN 



(9) 



is effective noise statistically independent of t with effective 
variance 



<7^ ff 4_ E || Zi 



eff 



(1 -a) 2 SNR. 



(10) 



Thus, using nested lattice codes, the original AWGN channel 
is transformed to an effective modulo-additive channel whose 
output is a lattice point from A f corrupted by effective noise 
statistically independent of it. 

It was shown in Q, that there exist sequences of 
pairs of nested lattices A^ C Ay™' that achieve a good 
error exponent with coset nearest neighbor decoding using 
this transformation. As a corollary, it followed that such pairs 
of nested lattices achieve any rate below the capacity of the 
AWGN channel C = 1/2 log (1 + SNR). 

Here, we prove a weaker result: we only prove the existence 



of sequences of pairs Ac C A^ that allow to achieve 



vanishing error probability for any rate below the capacity 
of the AWGN channel with coset nearest neighbor decoding. 
Namely, we show that there exist sequences of pairs that are 
"good" according to Definition 2J and that the "goodness" 
conditions from Definition [4] suffice to achieve any rate below 
capacity. Although these "goodness" conditions do not suffice 
for achieving good error exponents, using them instead of 
those consider in 0, gives rise to simpler proofs, which is 
the aim of this paper. 

Note that since the coarse lattice is normalized such that 



<T 2 (A { c n) ) = SNR, if Ar ; is good for MSE quantization than 
±\ogV (A^ ,l) ) -> ilog(27reSNR) as n-too. In the next 
section we show that for any lattice density in the range 

ilog(27rea 2 ff ) < ±-\ogV(A ( f n} ) < i log(27reSNR) 

there exists a sequence of "good" pairs of nested lattices. We 
also show that for such sequences of pairs, Z e ff is semi norm- 
ergodic. Therefore, any rate satisfying 



» 



R < lim 

tl— >cx 

1 



-logl/(A(" ) )--logT/(A ( f ™ ) ) 
n n 1 



< -log(27reSNR) 
1 / SNR N 



1 



log (27recr 2 ff ) 



(11) 



is achievable using nested lattice codes. Setting 
a = SNR/(1 + SNR) in ( [Tol l, which is the linear minimum 
MSE coefficient for estimating X from Y, we see that any 
rate below C = y2log(l + SNR) is achievable with nested 
lattice codes and coset nearest neighbor decoding. 

III. Existence Proof 

Previous proofs for the existence of capacity achieving pairs 
of nested lattices used random Construction A, introduced by 
Loeliger Q, for creating a fine lattice, and then rotated it using 



a lattice that is good for MSE quantization]^ Here, we take 
a slightly different approach that is a direct extension of the 
original approach of J4[ to creating nested binary linear codes. 
We use random Construction A to simultaneously create both 
the fine and the coarse lattice. Namely, we randomly draw 
a linear code and lift it to the Euclidean space in order to 
obtain the fine lattice. The coarse lattice is obtained by lifting 
a subcode from the same linear code to the Euclidean space. 
The ensemble of nested lattice pairs is defined as follows. 

Let 7 = 2V nSNR, and p — £na, where £ is chosen as the 
largest number in the interval [1/2,1) such that p is prime. 
For some e% > 0, to be specified later, choose fci such thafl 



h. l 

— logp = - log 
n 2 



Vn 



2/h 



ex- 



(12) 



1) Draw a generating k x n (k > k{), matrix Gf whose 
entries are i.i.d. and uniform over the elements of the 
prime field Z. p = {0, 1, • • • ,p — 1}. Let G c be a ki x n 
matrix whose rows are the first k\ rows of Gf. 

2) Define the discrete codebooks 

C c = {x € Z™ : x = [w T G c ] mod p weZj 1 } 
and 

C/ = {xeZ£ : x = [w T G/] mod p weZj}. 

3) Apply Construction A to lift C c and Cf to R™ and form 
the lattices 

A c = 1V - X C C + 7 Z n , A/ = + jZ n . 

Remark 1: We have chosen to specify our ensemble in 
terms of the linear codes' generating matrices 



G' 



We could have equally defined the ensemble using the linear 
codes' parity check matrices 



H c 



H 



/ 



H' 



as done in 0], (8) for ensembles of nested binary linear codes. 

Clearly, in this construction A c C A/. Moreover, if the 
linear code's generating matrix G/ is full-rank over Z p , 
which by construction implies that G c is also full-rank, we 
have V(A C ) = 7 >~' £l and V(A f ) = j n p- k . Thus, the 
nesting ratio is p~^ and the rate of the induced codebook 
C = Af n V c is therefore R = log p. The probability that 
Gf is not full-rank can be bounded [6| by 

Pr(rank(G/) < k) <p k ~ n , 

3 More precisely, the coarse lattice in the previous proofs was Rogers good, 
which also implies goodness for MSE quantization. 

4 For ease of exposition we disregard the fact that by this definition k\ may 
not take an integer value. We can always round k\ to the nearest larger integer 
with no effect on the derivations in the sequel. 



which vanishes as long as k < (3n for some < j3 < 1. 
We take k to satisfy this restriction, which ensures that G c 
is full-rank with high probability. Thus, in the sequel we will 
assume that G c is indeed full-rank. Otherwise, the obtained 
nested lattices pair is regarded as "bad". 

Next, we show that almost all members of the ensemble 
satisfy the first "goodness" condition from Definition @] and 
that almost all members of the ensemble satisfy the second 
condition of Definition |4] It follows that almost all pairs 
of nested lattices in the ensemble satisfy both conditions 
simultaneously, and are therefore "good". We then show that 
for coarse lattices A c which are good for MSE quantization, a 
random dither uniformly distributed over the Voronoi region is 
semi norm-ergodic. This in turn implies that for such lattices, 
the effective noise Z e ff from the mod-A transmission scheme 
is also semi-spherical. Finally, we conclude that with "good" 
pairs of nested lattices, the capacity of the AWGN channel 
can be achieved using the mod-A transmission scheme. 

Before going into the proofs we need to introduce some 
more notation. Let CUBE = [-1/2,1/2)" denote the unit 
cube centered at zero. Also, denote the operation of reducing 
each component of x e R n modulo 7 by x* = [x] mod j*E n . 
If S is a set of points in R™, S* is the set obtained by reducing 
all points in S modulo 7Z™. If S and T are sets, S + T is 
their Minkowski sum. In the sequel, we use the following 
lemma, which follows from simple geometric arguments and 
is illustr ated in Fig ure [TL 




Fig. 1. An illustration of Lemma [T] The solid circle is the boundary of 
B(s, r), and the points inside the small bright circles are the members of the 
set Z n n B(s, r). The set = Z n n S(s, r) + CUBE is the shaded area, 
and as the lemma indicates, it contains B(s, r — -^-) and is contained in 
S(s, 



-^), whose boundaries are plotted in dashed circles. 



Lemma 1: For any s £ R™ and r > the number of points 
of Z™ inside B(s,r) can be bounded as 



2 ' v - 



V.°M K < |Z"n#(s,r)| < 



Proof: Let S = {Z n n B(s, r)) + CUBE, and note that 
|Z" n B(s, r)| = Vol(«S). We have 



(13) 



B(s,r + ^-). 



5 C B(s, r) + CUBE C B(s, r) + B(0, ^ 
Thus, 

Vol \B \b, r - j < Vol (S) < Vol (b (s,r + ^ 



A. Goodness for MSE Quantization 

In this subsection we prove the following lemma, which 
follows from rate-distortion considerations. 

Lemma 2: For any 5\ > and n large enough, almost all 
lattices A c in the defined Construction A ensemble satisfy 

1 



G(A C ) < + 



2ire 



Proof: We begin by bounding the average MSE distortion 
the ensemble achieves, and then we connect it to the normal- 
ized second moment. For any x G R", define 

1 



d(x, A c ) = — min llx — All 

n agA c 

1 • II 
= — mm x - 

n aez»,cec c 



"fp c — 7a|| 2 



= — min 1 1 (x — jp 1 c)*|| 2 . 
n cgC c 

Note that d(x, A c ) < 7 2 /4 for any x G R", regardless of C c . 

For any w G Z^ 1 \ 0, define the random vector 
C(w) = [w T G c ] mod p, and note that C(w) is uniformly 
distributed over Z™. For all we^'\0 and x G W\ we 
have 

e = Pr fin (x - 7 p- 1 C(w))* || 2 < SNR 

(jp~ 1 Z™) P| B*(x, V nSNR 
(7P _1 Z n )p|B(x, VnSNR) 



pi 



y n (7" 2 nSNR)t 1 



2pVSNR 



= V n 



(14) 
(15) 

(16) 



where ( fT4T > follows since 7 = 2VnSNR, and hence, for any 
two distinct points t>i,b 2 G B(x, %/nSNR) we have h\ 7^ b 2 
(that is, the ball B(x, V nSNR) is contained in a cube with side 
7), and (TT3T > follows from Lemma [TJ Substituting p = (n? 
gives 



To see this, note that any x G B (s, r — ^-^j lies inside 
a + CUBE for some a G Z n , and for this a the inequality 
||a — x|| < y / n/2 holds. Applying the triangle inequality gives 

||a - s|| = || (a - x) + (x - B )|| < || (a - x)|| + ||(x - s)|| < r, 

which implies (fT~3b . On the other hand, 



e > V n 2~ n 1 - 



> V n 2- n 1 - - 
n 



> ~^V n 2~ 



(17) 



where O holds for all n > 4. Let M = p kl - 1. Label 
each of the vectors w G Z^ 1 \ by an index i = 1 , . . . , M, 
and refer to its corresponding codeword as Cj. Define the 
indicator random variable related to the point x G K n 



fl if i|| (x- 7 p- 1 C 4 



< SNR 



otherwise 



Since each Xi occurs with probability e, we have 



,M v / M 

v i=l \ i=l 

1 M 



£ = — £ 




> £ 



(18) 



where the last inequality follows from Chebyshev's inequality. 
In order to further bound the variance term from ( fT8l . we note 
that C(wi) and C(w2) are statistically independent unless 
wi = [0W2] mod p for some a G Z p . Therefore, each Xi is 
statistically independent of all but p different x/ s - Thus, 



Var 



' j M \ ^ M M 



Cov(xi,Xi) 



< 



Mp£ 



Substituting into ( fT8l and using ( TPTl i. we see that for any 

x G 1" 



Pr (d(x, Ac) > SNR) = Pr ^ Xi = 



< 



Me 



< ii! ■ 



< 2n'p- fcl 2 n V r n _1 . (19) 



It follows that for any distribution on X we have 

Ex,A c (d(X,Ac)) 

< SNRPr(d(X,A c ) < SNR) + Pr (d(X, A c ) > SNR) 

< SNR(l + 2n7p- fe iv;r 1 2 n ) 

^SNR(l + 2nl2-"[- l0gP - l0S (^)]). 

Our choice of fci, as given in ( fT2b . ensures that the upper bound 
on the distortion averaged over X and over the ensemble 
of coarse lattices A c becomes arbitrary close to SNR as n 
increases. Since this is true for all distributions on X, we 
may take X ~ Unif (j[0, 1)")- Let U be a random variable 
uniformly distributed over the Voronoi region V c of a lattice A c 
randomly drawn from the ensemble. By construction, for any 
lattice A c in the defined ensemble [jp~ 1 C c + V c ] = 7[0, l) n . 
Moreover, reducing the set r yp~ 1 C c + V c modulo 7Z" does not 
change its volume. Therefore 



E Ac (a 2 (A c )) = Eu, Ac ( -||U|| 2 ) = E X ,A C (d(X,A c )) . 



It follows that 



lim E Ac (c 2 (A c )) < SNR. 



n— ^00 

The normalized volume of a fundamental cell of A c is lower 
bounded by 

V{K c )i > ( 7 >- fe1 )* 
= 4nSNRp~ 2 ^ 

= 2- £ W#SNR, (20) 
with equality if and only if G c is full-rank. This implies that 

lim E Ac (g(A^)) = lim E Ac 



<r 2 (A e ) 
V(A C ) 

<Hm EU^AJ) 
n ^°° 2- £ inK"SNR 
= 2 ei lim -Vn" 

2?re 

The normalized second moment of any set of points 
in MP cannot be smaller than that of an n-dimensional 
ball, which approaches l/(2ne) as n increases. Therefore, 
for all lattices in the ensemble G(A C ) > l/2we. Applying 
Markov's inequality for the non-negative random variable 
T(A C ) = G(A C ) - l/(27re) shows that for any m > 1 at least 
a fraction of (m — l)/m of the sequences of lattices in the 
ensemble satisfy 

lim G(Ai n) ) < (1 + m(2 £l - 1)) 
n— ¥00 zire 



Choosing e\ = log(l + 8 1 /{2m)) gives 

lim G(AW)<(1 + V2)^-, 

for at least a fraction of (m — l)/m of the sequences of lattices 
in the ensemble. Taking m as large as desired establishes the 
lemma. 

Note that for m > 21og(e) we have e\ < Si. We will use 
this fact in Subsection IIII-DI ■ 

B. Goodness for Coding Under Nearest Neighbor Coset De- 
coding 

The next lemma shows that most nested lattice pairs in 
the ensemble achieve arbitrary low error probabilities in the 
presence of additive semi norm-ergodic noise as long as their 
point density is not too high. 

Lemma 3: Let Z be an additive semi norm-ergodic noise 
with effective variance er| = iE||Z|| 2 . For any t € A^, e > 0, 
S > and n large enough, for almost all lattices in the 
ensemble, the error probability under coset nearest neighbor 
decoding is bounded by 

P e = Pr ( [Q Af (t + Z)] mod A c ^ t mod A c ) < e, 



provided that 



V(A f ) 



2/n 



27recr| 



> 1 + 5. 



An error event under coset nearest neighbor decoding has 
two possible sources. Either the additive noise Z is too 
large, which rarely happens since it is semi norm-ergodic, 
or there are competing codewords which are "too close" to 
the transmitted codeword t. Without loss of generality, we 
may assume that the zero codeword was transmitted. Before 
proving Lemma [3] we would first like to upper bound the 
probability that a competing codeword (i.e., a codeword that 
does not belong to the coset of the zero codeword A c ) falls 
inside a ball centered at the origin. Rather than computing 
this probability directly, we upper bound the probability of 
a larger event, namely the event that a lattice point from 
Af \ 7Z™ (rather than Af\ A c ) falls inside a ball centered at 
zero. This bound is given in the following proposition, whose 
proof follows from straightforward volume considerations. 
Proposition 1: Let Z be some n-dimensional random 
(21) vector with effective variance ct| = iE||Z|| 2 and let 
rz = 1/(1 + p)ncr\ for some p > 0. Then for any e > 0, 
p > and n large enough, for almost all lattices in the 
ensemble 

Pr((A / \ 7 Z")f|B(Z,r z )^0) < e, 
provided that 

V{A f f/ n 
2ne(l + p)a\ > + P ' 

Proof of Proposition Q} We show that the average 
probability over the ensemble vanishes with n, and therefore 



for almost all members of the ensemble this probability is 
small. Let 1(A) be the indicator function of the event A. 



E A/ (Pr((A / \ 7 Z")f)6(Z,r z )^0)) 
= E A/ E Z (l ((7P _1 C/ \ 0) f|B*(Z,r z ) + 
= E Z E A/ (l(( 7 p- 1 C / \O)f|S*(Z ! r z )^0 
= E z Pr(( 7 p- 1 C / \0)f|S*(Z,r z ) ^0 | Z 



A/ 
Z 



(22) 



Since each codeword in Cj \ is uniformly distributed over 
Z™, and there are less than p k such codewords (i.e., p k — 1), 
applying the union bound gives 

Pr((A / \ 7 Z»)f|S(Z,r z )^( 
Tp-^flBCZ.rz) 



k—n 



' E z [ jr 
< E z ( 




(23) 



^ K((i + pW1)" /2 c 

r y n p 

= { nV* /n (l+p)a z \ 
< ^27r e (l + p)a^" 2 



i/2 



where ( f23l follows from Lemma Q] and (l24t from the fact that 

2 /n 

nKn' < 2ire. If G/ is full-rank, as is the case for almost all 
lattices in the ensemble, we have 

E A , (Pr((A/\ 7 Z")flB(Z ! r Z )^0)) 

i/2 / 4SNR 



/ 27re(l+p)a| 



< (i + p)-T e V , 

which vanishes for large n. ■ 
Proof of Lemma\3} We upper bound the error probability 
of the nearest neighbor coset decoder using the bounded 
distance coset decoder, which is inferior. More precisely, we 
analyze the performance of a decoder that finds all lattice 
points of Af within an Euclidean distance r from t + Z, 
and reduces them modulo the coarse lattice A c . Namely, the 
decoder's output is the set [A/ f] B(t + Z, r)] mod A c . Note 
that all points in this set are codewords in C = Af n V c . If 
there is a unique codeword in this set, this is the decoded 
codeword. Otherwise, the decoder declares an error. It is 



easy to see that regardless of the choice of r, the nearest 
neighbor coset decoder makes the correct decision whenever 
the bounded distance coset decoder does. Therefore, the error 
probability of the nearest neighbor coset decoder is upper 
bounded by that of the bounded distance coset decoder. 

The bounded distance coset decoder makes an error only if 
at least one of the events 



or 



tgB(t + Z,r), 



(A / \(t + A c ))r|B(t + Z ) r)^ 



(25) 



(26) 



occurs. The event in d25l > is equivalent to Z <^ B(0, r), whereas 
the event in (f26]) is equivalent to (A/ \ A c ) f]B(Z, r) ^ 
which is included in the event (A/ \ 7Z") f| B(Z, r) ^ 0. 

Therefore, for any r > 0, we may apply the union bound, 
which gives 

P e < Pr (Z i 6(0, r)) + Pr ((A, \ 7 Z B ) f| B(Z, r) ^ 0) . 



Let p = + 5 - 1 > and note that 



27recr| 



> 1 + 5 



implies 



V(Af) 2 / n 
2ne(l + p)a 2 z > + P ' 



Thus, setting r = rz = y/JT+ p)na 2 ,, we have 

P e <Pr(Z^B(0,r z )) 

+ Pr((A / \ 7 Z")f|B(Z,r z )^0). (27) 

The first summand in ( 1271 ) can be made smaller than e/2 for 
n large enough, since Z is semi norm-ergodic. The second 
summand is smaller than e/2 for almost all members in the 
ensemble, by Proposition Q] It follows that for almost all 
members of the ensemble P e < e. ■ 

C. Mixture Noise Is Semi Norm-Ergodic 

Our aim is to show that a mixture noise composed of AWGN 
and a dither from a lattice that is good for MSE quantization, 
as appears in the mod- A channel ([8]), is semi norm-ergodic. 
First, we show that if A c is good for MSE quantization, i.e., 
if its normalized second moment is close to l/2we, then a 
random dither uniformly distributed over V c is semi norm- 
ergodic. To that end, we first prove the following lemma, 
which is a simple extension of (51 . 

Lemma 4: Let S G M" be a set of points with volume V{S) 
and normalized second moment 



G{S) 



1 r s ||x|| 2 dx 
nV(S) V(S)i ' 



Let r e ft be the radius of an n-dimensional ball with the same 
volume as V(S), i.e., V(S) — Kjr™ ff . For any < e < 1 
define 



a /27reG(5)-^(l-e)^ 



-r e ff. 



The probability that a random variable U ~ Unif (S) leaves a 
ball with radius r £ is upper bounded by 

Pr(U £ B{0,r £ )) < e. 

Proof: Let f e be the radius of a ball that contains exactly 
a fraction of 1 — e of the volume of S, i.e., 

Vol(<Sf|B(0,f e )) = {l-e)V(S). 

Clearly, Pr (U ^ B(0, f e )) = e. In order to establish the 
lemma we have to show that r c < r c . To that end, we write 

nG{S)V{S)r ' 



V{S) Jxes 
1 



\ 2 dx 



V(S) 



|x|| 2 dx 



e(sns(o,f e )) 



\\x\rdx . (28) 

te(Sn(R"\e(o,f e ) / 

The first integral in d28l may be lower bounded by replacing 
its integration boundaries with an n-dimensional ball B(0, p e ), 
where 



p\ = V n 5 (l-e)-nS)» 
is chosen such that V n p r e l = (1 — e)V(S). Thus 



(29) 



e(Sne(o,f e ) 



|x|| 2 dx> / llx|| 2 d> 



xee(o, Pe ) 
nV nP n t a 2 (B(0,p e )) 



nS) 1+ £(l-e) 1+ s 



(30) 



^(5)(l-e) 1+ ^r e 2 ft , (31) 



_ ra + 2 
~ n + 2 

where we have used ([3]l to get (130) . The second integral in ( 
is over a set of points with volume eV(S) which are all at 
distance greater than f € from the origin. Therefore, it can be 
bounded as 



|x|| 2 dx > eV(S)r 



'xe(Sn(R"\e(o,f e ))) 
Substituting (|3B and (O into (|28j gives 



nG(S)V(S)« > 



n + 2 



(32) 



(33) 



Using the fact that V(«S)» = K"r 2 ff , (O reduces to 



r 2 < 



< 



nV n -G{S)-^{l-ef 



as desired. ■ 
We would like to apply Lemma [4] to a sequence of random 



variables uniformly distributed over the Voronoi regions Vc 
of a sequence of lattices Ac . From ©, we have 



(n) 



»'eff 



(a^) < ^ + 2y(A(")), 



(34) 



If the sequence of lattices is good for MSE quantization, 
then for any 6\ > and n large enough 



G(AW)<(1 + ^)^. 



(35) 



Combining d34l l and d35l >. it follows that if A c ™^ is good for 
MSE quantization, for any 8% > and n large enough r e of 
Lemma |4] satisfies 



Si 



r e < (n + 2) ( r2(Ai" ) ). 



(36) 



For any 5 > and e > we may choose <5i = (5e/2, which 
gives the following proposition. 

Proposition 2: If the sequence of lattices Ac is good 
for MSE quantization, then the sequence of random vectors 
U ~ Unif(Vc™' ) ) is semi norm-ergodic. 

The next lemma states that any linear combination of semi 
norm-ergodic noise and a dither from a lattice which is good 
for MSE quantization is itself semi norm-ergodic. 

Lemma 5: Let Z = aN + /3U, where a, (3 £ R, N is semi 
norm-ergodic noise with effective variance er 2 ^ = iE||N|| 2 , 
and U is a dither statistically independent of N with effective 
variance a 2 , — iE||U|| 2 , uniformly distributed over the 
Voronoi region V of a lattice A which is good for MSE 
quantization. Then, the sequence of random vectors Z is semi 
norm-ergodic. 

Proof: Since N and U are statistically independent, the 
effective variance of Z is 



a 2 z = -E||Z|| 2 = a 2 ( r 2 J +/3 2 t T 2 J . 
n 



We have to prove that for any e > 0, 5 > and n large enough 



' eff 



' eff' 



Pr Z 



< e. 



For any e > 0, S > and n large enough we have 



Pr \Z$B{Q,yJ(l + 6)no%) 

= Pr(||Z|| 2 > (1 + 5)^1) 
= Pr(||N|| 2 > (l+<y)nof,) 

•Pr(||Z|| 2 >(l + <5)m7 2 | ||N|| 2 > (1 + 5)^) 
+ Pr(||N|| 2 < (l + 8)na^) 

•Pr(||Z|| 2 >(l + ,S)na 2 | ||N|| 2 < (1 + 5)^) 

< |+Pr (V||U|| 2 + 2a/?N T U 

> (1 + 5)np 2 alj I ||N|| 2 < (1 + S)na^ 



<-+Pr(/3 2 ||U|| 2 >n/3 2 ( T 2 J (l + ( 5/2)) 

+ Pr (2a/3N T U > n^ 2 a 2 J 6/2 I ||N|| 2 < (1 + S)r 



Let Z be AWGN with zero mean and variance r\jjn. Using 
a similar approach to that taken in [2, Lemma 11], we would 
now like to upper bound the pdf of U using that of Z. For 
any x € M ra we have 



/ G (x) _ / & (||x||) < / e (ru) 



2?re 



nV n 



2/ 1 



(37) 



'Ni 



(38) 



< + Pr (2a/3N T U > n0*o$jS/2 | ||N|| 2 < (1 + S)na^) 

(39) 

where (f37T > follows from the fact that N is semi norm- 
ergodic, (l38l from the union bound and d39l from the fact 
that U is semi norm-ergodic due to Proposition |2] We are left 
with the task of showing that the last probability in ( |39l can 
be made smaller than e/3 for n large enough. This requires 
some more work. 

Since U is semi norm-ergodic noise, than for any e 2 > 0, 
<5 2 > and n large enough 



/z(x) / a (||x||) " /g(ru) 
Thus, for any x £ M. n 

/ {j (x)<2tM 2 ?^ 2/ ") / . (x) . 

We can further bound d40b for large enough n as 

Pr(N T U > r |N = n) 

< e 2 + 2f iog((i + 25 2 )^^- 2 /") Pr(n T2 > r) 

= e2 + 2 fio g ((i + 2, 2 )^-v« )g /^r L \ 

Vllnlku/ 

where Q(-) is the standard Q-function, which satisfies 
Q(x) < e~ x2/2 . It follows that 

Pr (2q/3N t U > n/3 2 (T^(5/2 | ||N|| 2 < (1 + S)ncr^) 

< £2 

V^/3ctu<5/2 \ 



+ 2 f log((l+2,5 2 )^y- 2 /")g 



Pr ||U||> V(1 + *2K <e 2 



Let r\j — + S2)na 2 J , and /u(u) be the probability 

density function (pdf) of U. For any r > we have 

Pr (N T U > r |N = n) = / /u(u)l(n T u > r)du 
+ f /u(u)l(n T u>r)du 

J\u\>r u 



< 



1 



|u|<r u V(A) 

V(B(0,r v )) 



l\u\>r 
l(n T u > r)du + e 2 

1 



V(A) J H < r „ V(B(0,r v )) 



l(n T u > r)du + e 2 . 



Using the fact that A is good for MSE quantization we have 
V(K) 2 / n —} 27recr 2 J , and hence, for n large enough, 



/ y(g(0,ru)) 
I V(A) 



< (l + 25 2 



Taking <5 2 sufficiently smaller than 5 and e 2 < e/6, for n large 
enough we have 

Pr (2a/3N T U > n/3 2 a 2 J S/2 \ ||N|| 2 < (1 + S)na^) < ^. 



The next two corollaries are simple consequences of 
Lemma[5] The first follows since any i.i.d. noise is semi norm- 
ergodic, and the second follows by iterating over Lemma [5] 

Corollary 1: Let N be i.i.d. random vector and U be a 
dither random vector that is statistically independent of N 
and is uniformly distributed over the Voronoi region V of a 
lattice A that is good for MSE quantization. Then, any linear 
combination of the form Z = aN + (3JJ, where a, f3 6 R, is 
semi norm-ergodic. 

Corollary 2: Let Ui, • • • ,\Jk be statistically independent 
dither random vectors, each uniformly distributed over the 
Voronoi region 14 of A&, k = 1,...,K, that are all 
good for MSE quantization. Let N be an i.i.d. random 
vector statistically independent of {Ui,-- - ,TJk}- For any 
a,Pi, - ■ ■ , Pk G R the random vector Z = aN+^]f =1 /3fcUfe 
is semi norm-ergodic. 



Let U be a random vector uniformly distributed over B(0, r\j). 
We have 

Pr (N T U > r |N = n) < e 2 + (1 + 2<5 2 )* Pr(n T U > r). 

(40) 



D. Tying It All Together 

First, we combine Lemma |2] and Lemma [5] to obtain an 
existence theorem for nested lattice pairs which are "good" in 
the sense of Definition |U Note that in the considered lattice 



construction, the normalized volume of the coarse lattice is 
given by 



ilogV(A c ) = ilog( 7 >- fei ; 
n n 



log( 7 ) - — logp 
n 



log 



2/n 



ei 



-log(27reSNR) - e x . 



(41) 



Using the parameter k > fci in the definition of 
the nested lattices ensemble, we can set the normal- 
ized volume of the fine lattice to any desired value of 
i logy (A/) < \ log(27reSNR) - t x . Since ft from Lemma|2] 
is larger than e%, this means that we can set the normal- 
ized volume of the fine lattice to any desired value of 
ilogV(Ay) < ilog(27reSNR) - ft. Thus, we have the fol- 
lowing theorem. 

Theorem 1: For any <t| > 0, e > 0, 5 > and 

hog(2ireoi)+8 < ilogV(A/) < \ log(27r e SNR) - 5, 

for n large enough, there exists a pair of nested lattices 
A c C A/ such that 

1) a 2 (A c ) = SNR and G(A C ) < ±±f . 

2) For any lattice point t e A/, and additive semi norm- 
ergodic noise Z with effective variance a\ = iE||Z|| 2 



Pr ( [Q Af (t + Z)] mod A c ^ t mod A c 



< e. 



We now apply Theorem Q] to show that nested lattice codes 
achieve the capacity of the AWGN channel using the mod- 
A transmission scheme with coset nearest neighbor decoding. 
Theorem [T] states that nested lattice codes achieve arbitrary 
low error probabilities over channels of the form Y = t + Z, 
where Z is semi norm-ergodic additive noise. However, the 
output of the equivalent channel when applying the mod-A 
transmission scheme is not exactly of this form. Rather, it is 
given by 

Y e ff = [t + Z eff ] mod A c 

= t — Q c (t + Z e ff) + Z e ff. 

Thus, it is equivalent to an additive channel with input 
t — Q c (t + Z e ff). Nevertheless, as 

[t - Q c (t + Zeff)] mod A c = t, 

the coset nearest neighbor decoder cannot distinguish between 
the inputs t and t — Q c (t + Z e ff). Therefore, when the 
coset nearest neighbor decoder is used, the output of the 
induced channel is equivalent to t + Z e ff. See Figure |2] for an 
illustration of the nearest neighbor coset decoder's operation. 

The effective noise Z e ff is a linear combination of an AWGN 
and a dither uniformly distributed over V c . When the coarse 
lattice A c is good for MSE quantization, Z e ff is semi norm- 
ergodic by Corollary Q] 

It follows that for "good" pairs of nested lattices, the 
error probability in coset nearest neighbor decoding of Y e ft, 




Fig. 2. An illustration of the coset nearest neighbor decoding process. The 
lattice point t was transmitted. The output of the induced channel when the 
mod-A transmission scheme is applied is Y c ff = [t + Z c ff] mod A c . The 
decoder quantizes Y c ff to the nearest lattice point in A/ and reduces the 
quantized output modulo A c . In the figure the coarse lattice A c is the integer 
lattice. 



the output of the equivalent channel induced by the mod-A 
scheme, can be made arbitrary small for all 



R < - lor 



eff 

where cr^s ^ s tne effective variance of Z e ff. 

Remark 2: Note that we have not used the Gaussianity of 
N in the proof. Therefore, any rate below 1 /2log(SNR/(Tg ff ) 
is in fact achievable using nested lattice codes and coset 
nearest neighbor decoding for any additive i.i.d. noise N, or 
more generally, any additive semi norm-ergodic noise. This 
is analogous to the results of ifTOl where it is shown that 
1 /2log(l + SNR) is the highest rate a coding scheme that 
utilizes Gaussian codebooks and nearest neighbor decoding 
can achieve over an additive ergodic noise channel. 

Remark 3: Note that for unshaped transmission, i.e., where 
the coarse lattice is the cubic lattice, a random dither U 
uniformly distributed over its Voronoi region is an i.i.d. ran- 
dom vector. Hence, a linear combination of such a dither and 
AWGN is semi norm-ergodic. It follows that the effective noise 
in the mod-A transmission scheme is semi norm-ergodic even 
when no shaping is used. If the fine lattice is good for coding 
in the presence of additive semi norm-ergodic noise, which as 
Lemma [3] indicates, is the case for almost all Construction A 
lattices, then any rate satisfying 



R< -log 



SNR 



'eff 



/2ne\ 



— lOR , i 

2 S V 12 J 



(42) 



is achievable with a one-dimensional coarse lattice. 



IV. Extensions 
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During the last decade, nested lattice codes have been 
extensively used in the literature in the context of Gaussian 
network problems, see e.g. iTTTI — [fT3Tl . For such problems, 
a pair of nested lattices is often not sufficient, and more 
complicated chains of nested lattices are required. A major 
advantage of the nested lattice ensemble we have used in 
Section [Hi] for proving the existence of "good" pairs, is that 
it can be naturally extended to construct any (finite) chain of 
nested lattices. Specifically, we can draw a linear code with a 
generating matrix G of dimensions k x n over Z p , and then 
for k > kr, > ■ ■ ■ > k\ construct a sequence of nested linear 
codebooks C\ C • • • C Cl, by taking the generating matrix of 
the Ah codebook to be the first ke rows of G. These codewords 
can be lifted to the Euclidean space using Construction A in 
order to form a chain of nested lattices. 

In Lemma [2] we have shown that almost all random 
Construction A lattices are good for MSE quantization. In 
Lemma |3] we have shown that almost all random Construction 
A pairs of nested lattices achieve low error probabilities in 
the presence of additive semi norm-ergodic noise, under coset 
nearest neighbor decoding. It follows that for any finite number 
L and any ilogT^(A i ) < ■■■ < ilogV(Ai), there exists a 
chain of nested lattices A x C • ■ ■ C A^ such that all lattices 
are good for MSE quantization and all pairs within the chain 
achieve low error probabilities under coset nearest neighbor 
decoding in the presence of additive semi norm-ergodic noise. 

In certain applications, chains of nested lattice codes are 
used in order to convert a Gaussian multiple access channel 
(MAC) into an effective modulo-lattice channel whose output 
is a fine lattice point plus effective noise reduced modulo 
a coarse lattice. Such a situation arises for example in the 
compute-and-forward framework fl2l . where a receiver is 
interested in decoding linear combinations with integer valued 
coefficients of the codewords transmitted by the different 
users of the MAC. In such applications, the effective noise is 
often a linear combination of AWGN and multiple statistically 
independent dithers uniformly distributed over the Voronoi 
region of the coarse lattice. By Corollary [2] such an effective 
noise is semi norm-ergodic regardless of the number of dithers 
contributing to it, as long as they are all independent and are 
induced by lattices which are good for MSE quantization. 

In our proofs we took the cardinality p of the finite field 
over which we construct the linear codes to grow polynomially 
with n. While this facilitates the derivations, it does not seem 
to be necessary in practice when targeting a fixed (small) gap 
to capacity. Indeed, in lfl6l it was shown that construction A 
lattices with small values of p, such as 2, 3 and 5, achieve an 
NSM very close to l/(2?re). For additive noise channels with 
a PAM input constellation with prime cardinality, linear codes 
perform just as well as random codes. Therefore, it seems that 
for low /moderate values of SNR the ensemble of nested lattice 
codes we have used can achieve rates close to the capacity of 
the AWGN channel even with small values of p. 



The authors thank Bobak Nazer, Yair Yona and Ram Zamir 
for discussions that helped prompt this work. 
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